Monitoring emerging variants

As part of the UK pandemic response, we develop methods to identify and monitor emerging variants of interest. Your job today is to look at clusters of interest, including specific lineages or lineage and mutation combinations, and report on growth rates. We wil look at outputs from the scanning tool designed for Public Health England (PHE) as part of the SARS-CoV-2 pandemic response to monitor emerging variants of interest or concern.

PHE situation report

Imagine the date is the 28th of May, 2021. PHE has requested a situation report on the top 5 growth clusters of interest, on the growth rates of B.1.617.2 in the country as this is a new VOC, as well as clusters associated with mutations S:L452R and S:K417N. This report should be about three pages in length, and include a few figures or tables from the scanning tool as part of the report. Factors to consider when writing your report:

  1. Which lineages are the highest growth rate lineages?
  2. Where are these lineages occuring in the population? (Use google maps to look up UK locations if uncertain)
  3. How does their growth look over time? What about their frequency? How many LTLAs do we see them in?
  4. How do age distributions look in these lineages of interest?
  5. What about vaccinations? What information do we have on vaccinations?
  6. Do we see any mutations of interest or concern (to be discussed further in the mutations section below)?
  7. How might current non-pharmaceutical interventions impact growth rates (remember the date is 28/05/2021 so the so called “freedom day” has not occured yet)?

Additionally, remember that we are only including P1 (lighthouse) samples in our scanning tool and outputs. This is always good to mention. Why? Because sampling from certain populations can lead to sample bias in our results. Discuss with your classmates how we might bias our results if we included hospital cases in our scanner tool.

N.B You will include this report as part of the summative SuS Revolutions in Biomedicine portfolio. You are not expected to complete this report in the practical period, so please take the time to discuss outputs with your classmates as you work through the practical together.

Lineage nomenclature

Table 1. provides a reference of lineages of interest or concern that you should consider looking at in the scanner outputs. Please note, all scanner outputs utlise the pangolin lineage nomenclature, but feel free to call these be either WHO, pangolin lineage nomenclature or VOC/VUI designation in your report write up.

 

 

Growth rates from scanner output

We first look at growth rates for all clusters in the most recent scanner run, with date of last sample from the cluster of interest on the x axis. Although there are quite a few clusters on the image, notice that we can toggle clusters with specific lineages on and off the figure by clicking on a lineage in the legend. Although clusters can (and generally do) contain more than one lineage, we identify which lineage is most prominent in a cluster and label a cluster as “lineage +”.

Questions to consider when looking at growth rates between clusters:

Which clusters show the highest growth rates? What lineages do they include? How recent are teh sample dates? Do we see clusters with higher growth rates but less recent samples? Specifically, what do we see happening with B.1.1.7 and B.1.617.2? B.1.1.7 has been the dominant lineage in circulation since the start of the pandemic. However, B.1.617.2 has recently shown potential transmission advantage. How large (number of sequences) are the clusters for B.1.617.2, B.1.617.2+ and B.1.1.7? Where do we see the largest B.1.1.7 clusters?

This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com.

When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the document. You can embed an R code chunk like this:

summary(cars)
##      speed           dist       
##  Min.   : 4.0   Min.   :  2.00  
##  1st Qu.:12.0   1st Qu.: 26.00  
##  Median :15.0   Median : 36.00  
##  Mean   :15.4   Mean   : 42.98  
##  3rd Qu.:19.0   3rd Qu.: 56.00  
##  Max.   :25.0   Max.   :120.00

Including Plots

You can also embed plots, for example:

Note that the echo = FALSE parameter was added to the code chunk to prevent printing of the R code that generated the plot.